AITopics

2511.13341

Country: Asia > China (0.14)

Genre:

Workflow (0.69)
Research Report (0.50)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Software (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Arani, Ali Kazemi, Le, Triet Huynh Minh, Zahedi, Mansooreh, Babar, M. Ali

Systematic Literature Review on Application of Learning-based Approaches in Continuous Integration

arXiv.org Artificial IntelligenceJul-2-2024

Context: Machine learning (ML) and deep learning (DL) analyze raw data to extract valuable insights in specific phases. The rise of continuous practices in software projects emphasizes automating Continuous Integration (CI) with these learning-based methods, while the growing adoption of such approaches underscores the need for systematizing knowledge. Objective: Our objective is to comprehensively review and analyze existing literature concerning learning-based methods within the CI domain. We endeavour to identify and analyse various techniques documented in the literature, emphasizing the fundamental attributes of training phases within learning-based solutions in the context of CI. Method: We conducted a Systematic Literature Review (SLR) involving 52 primary studies. Through statistical and thematic analyses, we explored the correlations between CI tasks and the training phases of learning-based methodologies across the selected studies, encompassing a spectrum from data engineering techniques to evaluation metrics. Results: This paper presents an analysis of the automation of CI tasks utilizing learning-based methods. We identify and analyze nine types of data sources, four steps in data preparation, four feature types, nine subsets of data features, five approaches for hyperparameter selection and tuning, and fifteen evaluation metrics. Furthermore, we discuss the latest techniques employed, existing gaps in CI task automation, and the characteristics of the utilized learning-based techniques. Conclusion: This study provides a comprehensive overview of learning-based methods in CI, offering valuable insights for researchers and practitioners developing CI task automation. It also highlights the need for further research to advance these methods in CI.

application, international conference, ml model, (14 more...)

2406.19765

Country:

Asia (0.04)
Oceania > Australia > South Australia > Adelaide (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry: Information Technology (1.00)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
(5 more...)

arXiv.org Artificial IntelligenceMay-16-2024

Automating the Training and Deployment of Models in MLOps by Integrating Systems with Machine Learning

Liang, Penghao, Song, Bo, Zhan, Xiaoan, Chen, Zhou, Yuan, Jiaqiang

This article introduces the importance of machine learning in real-world applications and explores the rise of MLOps (Machine Learning Operations) and its importance for solving challenges such as model deployment and performance monitoring. By reviewing the evolution of MLOps and its relationship to traditional software development methods, the paper proposes ways to integrate the system into machine learning to solve the problems faced by existing MLOps and improve productivity. This paper focuses on the importance of automated model training, and the method to ensure the transparency and repeatability of the training process through version control system. In addition, the challenges of integrating machine learning components into traditional CI/CD pipelines are discussed, and solutions such as versioning environments and containerization are proposed. Finally, the paper emphasizes the importance of continuous monitoring and feedback loops after model deployment to maintain model performance and reliability. Using case studies and best practices from Netflix, the article presents key strategies and lessons learned for successful implementation of MLOps practices, providing valuable references for other organizations to build and optimize their own MLOps practices.

deployment, integration, mlop, (17 more...)

2405.09819

Country:

North America > United States > New York (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry: Information Technology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Arani, Ali Kazemi, Le, Triet Huynh Minh, Zahedi, Mansooreh, Babar, Muhammad Ali

Systematic Literature Review on Application of Machine Learning in Continuous Integration

arXiv.org Artificial IntelligenceJul-17-2023

This research conducted a systematic review of the literature on machine learning (ML)-based methods in the context of Continuous Integration (CI) over the past 22 years. The study aimed to identify and describe the techniques used in ML-based solutions for CI and analyzed various aspects such as data engineering, feature engineering, hyper-parameter tuning, ML models, evaluation methods, and metrics. In this paper, we have depicted the phases of CI testing, the connection between them, and the employed techniques in training the ML method phases. We presented nine types of data sources and four taken steps in the selected studies for preparing the data. Also, we identified four feature types and nine subsets of data features through thematic analysis of the selected studies. Besides, five methods for selecting and tuning the hyper-parameters are shown. In addition, we summarised the evaluation methods used in the literature and identified fifteen different metrics. The most commonly used evaluation methods were found to be precision, recall, and F1-score, and we have also identified five methods for evaluating the performance of trained ML models. Finally, we have presented the relationship between ML model types, performance measurements, and CI phases. The study provides valuable insights for researchers and practitioners interested in ML-based methods in CI and emphasizes the need for further research in this area.

artificial intelligence, machine learning, systematic literature review, (2 more...)

2305.12695

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceApr-7-2023, 00:25:56 GMT

Build Reliable Machine Learning Pipelines with Continuous Integration

As a data scientist, you are responsible for improving the model currently in production. After spending months fine-tuning the model, you discover one with greater accuracy than the original. Excited by your breakthrough, you create a pull request to merge your model into the main branch. Unfortunately, because of the numerous changes, your team takes over a week to evaluate and analyze them, which ultimately impedes project progress. Furthermore, after deploying the model, you identify unexpected behaviors resulting from code errors, causing the company to lose money.

pipeline, remote storage location, workflow, (11 more...)

Technology:

Information Technology > Data Science (0.87)
Information Technology > Artificial Intelligence > Machine Learning (0.71)

Arani, Ali Kazemi, Zahedi, Mansooreh, Le, Triet Huynh Minh, Babar, Muhammad Ali

SoK: Machine Learning for Continuous Integration

arXiv.org Artificial IntelligenceApr-5-2023

Abstract--Continuous Integration (CI) has become a wellestablished software development practice for automatically and continuously integrating code changes during software development. An increasing number of Machine Learning (ML) based approaches for automation of CI phases are being reported in the literature. It is timely and relevant to provide a Systemization of Knowledge (SoK) of ML-based approaches for CI phases. Our systematic analysis also highlights the deficiencies of the existing ML-based solutions that can be improved for advancing the state-of-the-art. Given the variety of employed techniques in applying ML solutions in CI, and growing interest in this domain, it is In recent years, the software development industry has seen necessary to systematically identify state-of-the-art practices a significant shift towards the adoption of Continuous Integration used for automating CI tasks through ML methods.

algorithm, artificial intelligence, machine learning, (13 more...)

2304.02829

Country:

Oceania > Australia > South Australia > Adelaide (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Nebraska > Lancaster County > Lincoln (0.04)
(2 more...)

Genre:

Overview (0.94)
Research Report > Experimental Study (0.46)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

#artificialintelligenceFeb-25-2023, 11:25:35 GMT

Automating Machine Learning Pipelines with CI/CD/CT: A Guide to MLOps Best Practices

MLOps, short for Machine Learning Operations, is an emerging practice that brings together the disciplines of machine learning and DevOps to streamline the entire lifecycle of machine learning models, from development to deployment and beyond. One of the key aspects of MLOps is the use of automation to improve the efficiency, reliability, and quality of machine learning pipelines. In this tutorial, we will explore how to use Continuous Integration (CI), Continuous Delivery (CD), and Continuous Testing (CT) to automate the deployment of machine learning models. Before we dive into the details of MLOps automation, let's briefly explain the three key concepts that underpin it: MLOps automation typically involves a series of steps that automate the entire machine learning pipeline, from data preparation to model deployment. To automate this process, we can use a combination of CI/CD/CT tools and techniques.

automate, automating machine learning pipeline, mlop automation, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceSep-28-2022, 14:05:54 GMT

Automate Model Deployment with GitHub Actions and AWS

This article was published as a part of the Data Science Blogathon. In a typical software development process, the deployment comes at the end of the software development life cycle. First, you build software, test it for possible faults, and finally deploy it for the end user's accessibility. The same can be applied to machine learning as well. In a previous article, I described how we could build a model, wrap it with a Rest API, containerize it, and finally deploy it on cloud services.

github action, task definition, workflow, (12 more...)

Technology:

Information Technology > Data Science (0.70)
Information Technology > Software Engineering (0.59)
Information Technology > Artificial Intelligence > Machine Learning (0.55)

#artificialintelligenceFeb-21-2022, 15:15:22 GMT

MLOps & Machine Learning Pipeline Explained - Medi-AI

MLOps is a compound term that combines "machine learning" and "operations." The role of MLOps, then, is to provide a communication conduit between data scientists who work with machine learning data and the operations team that manages the project. To do so, MLOps applies the type of cloud-native applications used in DevOps to machine learning (ML) services, specifically continuous integration/continuous deployment (CI/CD). Although both ML and normal cloud-native apps are written in (ok, result in) software, there is more to ML services than just code. While cloud-native apps require source version control, automated unit-/load -testing, AB testing, and final deployment, MLOps uses a data pipeline, ML model training, and more complex deployment with special purpose logging-monitoring capabilities.

deployment, machine learning pipeline explained, ml service, (9 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceFeb-1-2022, 10:45:30 GMT

5 ways machine learning uses CI/CD in production

Continuous integration (CI) is the process of all software developers merging their code changes in a central repository many times throughout the day. A fully automated software release process is called continuous delivery, abbreviated as CD. Although the two terms are not interchangeable, CI/CD is a DevOps methodology and fits in that category. A continuous integration/continuous delivery (CI/CD) pipeline is a system that automates the software delivery process. CI/CD pipelines generate code, run tests, and deliver new product versions when software is changed.

ci cd, delivery, pipeline, (16 more...)

Industry: Information Technology (0.52)

Technology:

Information Technology > Software Engineering (0.75)
Information Technology > Artificial Intelligence > Machine Learning (0.71)
Information Technology > Data Science > Data Integration (0.62)